NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

SoK: Fair Clustering: Critique, Caveats, and Future Directions

https://doi.org/10.1109/SaTML64287.2025.00044

Dickerson, John; Esmaeili, Seyed A; Morgenstern, Jamie; Zhang, Claire Jie (April 2025, IEEE)

Free, publicly-accessible full text available April 9, 2026
Doubly Constrained Fair Clustering

Dickerson, John; Esmaeili, Seyed A; Morgenstern, Jamie; Zhang, Claire Jie (December 2023, NeurIPS 2023)

Full Text Available
Fair Labeled Clustering

https://doi.org/10.1145/3534678.3539451

Esmaeili, Seyed A.; Duppala, Sharmila; Dickerson, John P.; Brubach, Brian (August 2022, KDD '22: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining)

Full Text Available
Fair Clustering Under a Bounded Cost

Esmaeili, Seyed A.; Brubach, Brian; Srinivasan, Aravind; Dickerson, John P. (December 2021, Proc. Conference on Neural Information Processing Systems (NeurIPS))

Clustering is a fundamental unsupervised learning problem where a data-set is partitioned into clusters that consist of nearby points in a metric space. A recent variant, fair clustering, associates a color with each point representing its group membership and requires that each color has (approximately) equal representation in each cluster to satisfy group fairness. In this model, the cost of the clustering objective increases due to enforcing fairness in the algorithm. The relative increase in the cost, the “price of fairness,” can indeed be unbounded. Therefore, in this paper we propose to treat an upper bound on the clustering objective as a constraint on the clustering problem, and to maximize equality of representation subject to it. We consider two fairness objectives: the group utilitarian objective and the group egalitarian objective, as well as the group leximin objective which generalizes the group egalitarian objective. We derive fundamental lower bounds on the approximation of the utilitarian and egalitarian objectives and introduce algorithms with provable guarantees for them. For the leximin objective we introduce an effective heuristic algorithm. We further derive impossibility results for other natural fairness objectives. We conclude with experimental results on real-world datasets that demonstrate the validity of our algorithms.
more » « less
Full Text Available
Probabilistic Fair Clustering

Esmaeili, Seyed; Brubach, Brian; Tsepenekas, Leonidas; Dickerson, John (December 2020, Advances in neural information processing systems)
null (Ed.)
Full Text Available
Fair Clustering Under a Bounded Cost

Esmaeili, Seyed; Brubach, Brian; Srinivasan, Aravind; Dickerson, John P (January 2021, Advances in Neural Information Processing Systems 34)

Clustering is a fundamental unsupervised learning problem where a dataset is partitioned into clusters that consist of nearby points in a metric space. A recent variant, fair clustering, associates a color with each point representing its group membership and requires that each color has (approximately) equal representation in each cluster to satisfy group fairness. In this model, the cost of the clustering objective increases due to enforcing fairness in the algorithm. The relative increase in the cost, the `''price of fairness,'' can indeed be unbounded. Therefore, in this paper we propose to treat an upper bound on the clustering objective as a constraint on the clustering problem, and to maximize equality of representation subject to it. We consider two fairness objectives: the group utilitarian objective and the group egalitarian objective, as well as the group leximin objective which generalizes the group egalitarian objective. We derive fundamental lower bounds on the approximation of the utilitarian and egalitarian objectives and introduce algorithms with provable guarantees for them. For the leximin objective we introduce an effective heuristic algorithm. We further derive impossibility results for other natural fairness objectives. We conclude with experimental results on real-world datasets that demonstrate the validity of our algorithms.
more » « less
Full Text Available
An end-to-end Differentially Private Latent Dirichlet Allocation Using a Spectral Algorithm

DeCarolis, Christopher; Ram, Mukul; Esmaeili, Seyed A; Wang, Yu-Xiang; Huang, Furong (January 2020, International Conference on Machine Learning)

Full Text Available

Search for: All records